CDS

Accession Number TCMCG033C23099
gbkey CDS
Protein Id TQD91291.1
Location 1115..2563
Organism Malus baccata
locus_tag C1H46_023103

Protein

Length 482aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA428857, BioSample:SAMN08323692
db_source VIEB01000424.1
Definition hypothetical protein C1H46_023103 [Malus baccata]
Locus_tag C1H46_023103

EGGNOG-MAPPER Annotation

COG_category CG
Description Belongs to the UDP-glycosyltransferase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07575        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00523        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K14595        [VIEW IN KEGG]
EC 2.4.1.263        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00906        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00906        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATTCTTCAGAAACTCCAGTCGTCATGTACTTCTTCCCATTTGTAGGCGGAGGCCACCAGATTCCTATGATCGACATGGCCAGAGTCTTCTCCTCCCATGGAGCCAAAGTCACAATCCTAAGCACCACTCCCGCCGACGCCCTCCGCTTCCGAAACTCCATCCGCCGCGATCAAACCCTCAACCGCTCAATCACCATCCACGTCCTCAAGCTCCCAGGCGACGACGCCTCGGCCGACTCTTCCATGACCTCAGCCCCCCTCACTGACACCTCAGTCCTCCAAGAATCCCTTAGGCAATTCATCACCCAAAACCTACCCAATTGCATCGTAATCGACGTCTTCCACCGTTGGGCGGCACAAGTCATCGACGAGCTTTTCATCAAACGGGTGGTGTTCAATGGAAACGGGTTATTCTCCCGCTGCGTCAGTGAGTGTATCGGCCGATTCGCGCCGCATCAGAATGTGGGTTCTCATTGCGAGCCATTTCTAGTACCGAACTTGCCCGATCGGATCGAATTGACGAAATCTCAGCTGCCTTCTTTCGCAAGAAACAGGCCAGGGCTTCCTGATAAGGTGGGAAAAGCAGAGGAGAAGAGTTTTGGGGTTGTGGTGAACAGTTTTTACGAATTGGAGTCGAAATATGTGGAGTATTTCACGACTGAGTTGGGGAAGAAGGCATGGCCGATCGGCCCGGTTTCACTATACAACCGAAGCAACGACGATAAGACTGACAGAGGCCAAGCAGCCTTGGTCGATGAGCAGAGCCTCCTGCATTGTCTGAATTGGTTGGATTCCAAGGAACCTGCTTCGGTGGTTTATATCAGTTTTGGGAGCTTGGCTCGGTTGTCTGCAGCCCAACTCGTCGAAATCGCACATGGGATTGAATCTTCGGGGCATAATTTCGTTTGGGTAATCGGAAAAATCTTCAGAGCGGTGGAGGACGGTGGTTATGTTGGAGACAAAGAGGATTGGATTCCGGCGGGATTTGCAGAGAGAATGTGGGAAATGAAGAGAGGGGTTGTGATAGGTGGGTGGGCCCCGCAGATTCTGATACTGGAGCACTGCGCTGTTGGCGGGTTTGTGAGCCACTGCGGGTGGAACTCGACATTGGAGAGCGTGAGCGCAGGGGTGCCCATGGTGACTTGGCCATTGTCGGCGGAGCAGTTCTACAACGAGAAGCTGATAACTGATGTGTTGGGCATAGGGGTGCAAGTGGGGAGTAGGGAATGGGAGTCGTGGAATGTGGAGAGGAAGGAACTGGTGAGGAGAGAGAAGGTGGATGCGGCGGTGAGAAGGATGATGGGCGGCAGTGATGAGGCGGCGGAAATGAAGAAGAGAGCGAGAGCGCTTTCAGAGAAGGCGAAGAGAGCTGTGGAAGAAGGTGGGTCTTCATATGTAGGGGTGGATGCTCTGATTTTAGAGATCAGATTATCGAGGCAGAATTAG
Protein:  
MDSSETPVVMYFFPFVGGGHQIPMIDMARVFSSHGAKVTILSTTPADALRFRNSIRRDQTLNRSITIHVLKLPGDDASADSSMTSAPLTDTSVLQESLRQFITQNLPNCIVIDVFHRWAAQVIDELFIKRVVFNGNGLFSRCVSECIGRFAPHQNVGSHCEPFLVPNLPDRIELTKSQLPSFARNRPGLPDKVGKAEEKSFGVVVNSFYELESKYVEYFTTELGKKAWPIGPVSLYNRSNDDKTDRGQAALVDEQSLLHCLNWLDSKEPASVVYISFGSLARLSAAQLVEIAHGIESSGHNFVWVIGKIFRAVEDGGYVGDKEDWIPAGFAERMWEMKRGVVIGGWAPQILILEHCAVGGFVSHCGWNSTLESVSAGVPMVTWPLSAEQFYNEKLITDVLGIGVQVGSREWESWNVERKELVRREKVDAAVRRMMGGSDEAAEMKKRARALSEKAKRAVEEGGSSYVGVDALILEIRLSRQN